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PART  ONE 

RECOGNinON  OF  PARTIALLY  OCCLUDED  3-OlMENSIONAL  OBJECTS 
USING  INFORMATION  FROM  CONCAVITIES 

I  INTRODUCTION 

In  order  to  recognize  3-dimensioral  objects  in  2-diniensional  scenes  a  shape 
description,  conplete  enoqgh  to  determine  the  3'dimensional  object,  must  be 
recovered  from  the  scene.  Although  it  is  possible  to  recover  some  3-dimensionai 
information,  for  instance  shape  from  shade  [21]  or  shape  from  texture  [13],  in 
general  the  information  recovered  will  be  a  2-dimensional  representation  of  the 
object.  Mary  methods  are  available  for  interpreting  3-dimensional  objects  in  2- 
dimensionally  arrayed  range  images  [3],  [20];  however  this  is  considered  as  a 
somewhat  different  problem,  since  depth  information  can  be  derived  from  the  scene 
with  little  or  no  ambiguity.  The  represeitation  recovered  from  regular  2dimensional 
intensity  sceneswill  consist  of  a  mapping  from  an  object-centered  coordinate  space 
to  a  viewer^sntered  coordinate  space  [2].  This  mapping  represents  any  reorientation 
of  the  3-dimensional  object  within  its  coordinate  system  which  results  in  a  change 
vnth  regard  to  the  viewer. 

Currently  there  are  two  approaches  to  frarmir^  an  object  description  under  these 
conditions.  In  the  first  approach  [4],  a  3-dimensional  representation  of  all  the 
objects  to  be  recognized  is  stored.  Then  when  a  2-dlmensional  scene  is  to  be 
analyzed,  the  3-dimenslonal  model  is  transformed  to  obtain  its  2-dimensional 
equivalent.  The  corresponding  2-dimensional  versions  are  then  matched  against  those 
extracted  from  the  scene.  The  main  difficulty  with  this  approach  is  the  amount  of 
upfront  processing  required  to  comfxjte  an  adequate  number  of  transformations  to 
required  to  compute  an  adequate  number  of  transformations  to  represent  the  possible 
object  views. 

In  the  second  approach,  the  transformed  object  representations  are  calculated 
a-priori  and  the  corresponding  2-dimensional  descriptions  stored  for  the  scene 
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analysis  algorithm.  Obviously  the  2-dimensional  descriptions  must  be  compact 
enough  so  that  a  sufficient  number  of  views  can  be  stored  without  exceeding  memory 
restrictions.  Fourier  descriptors  [18], [19],  and  boundary  curvature  [26]  have  been 
used  to  describe  3dimensional  objects  in  this  manner  however,  they  are  inadequate 
when  conditions  of  occlusion  and  boundary  noise  respectively  are  allowed.  In  this 
paper,  a  method  for  3-dimensional  object  identificationsimilar  to  the  two  above  but 
with  much  greater  immunity  to  conditions  of  occlusion  and  boundary  noise  is 
presented.  The  method  is  believed  to  be  applicable  in  many  industrial  and  military 
systems  where  efficient  and  reliable  idaitification  is  necessary. 

In  section  II,  the  concepts  of  shape  space  and  shape  vector  are  introduced.  The 
formalization  of  these  concepts  provides  a  basis  upon  which  shape  contours  can  be 
analyzed,  even  in  the  context  of  incomplete  knowledge  (  occlusion  ).  The  underlying 
structure  for  constructir^  the  shape  vector  is  the  critical  point.  In  section  III  a  new 
method  for  defining  critical  points  is  described,  which  in  many  cases  offers 
improved  performance  versus  those  defined  on  the  basis  of  curvature.  In  section  IV  a 
procedure  for  extracting  these  new  critical  points  from  descrete  data  is  presented, 
and  in  section  V  the  corresponding  shape  identification  algorithm  or  cognitive  stage 
is  described.  Finally,  in  section  VI  the  procedures  ability  to  recognize  partially 
obscured,  complex  3-dimensional  objects  in  2-dimensional  scenes  is  demonstrated. 

II  SHAPE  SPACE  AND  FEATURE  VECTORS 

In  order  to  distinguish  one  shape  from  another,  or  a  partial  shape  as  a  part  of  a 
whole  shape,  some  mechanism  which  examines  the  relationship  between  shape 
measurements  is  needed.  One  such  mechanism  which  is  ideal  for  this  application  is 
a  shape  space  [24].  By  introducing  a  shape  space  defined  on  the  measurements, 
relationships  between  these  measurements  can  be  quantified  and  thereby  compared. 


The  basic  element  of  shape  space  is  the  shape  vector  ^Z,  where  j  indicates  the  jth 

set  of  measurements  from  the  kth  shape.  If  there  are  K  shapes  to  be  analyzed,  then  k 

is  integer  valued  from  1  to  K.  The  range  of  j  depends  on  the  total  number  of 

measurements  possible  on  a  given  shape  k.  The  shape  vector  consists  of  a 

measurement  vector,  normalized  by  a  scalar  factor  referred  to  as  the  size 

J 

variable.  Thus, 


h  =  ^ 

J  J  J 


The  measurement  vector  consists  of  I  measurements  between  predetermined  points 
on  the  shape  j^S,  represented  notationally  by 


. jS ) 


Combining  (1)  and  (2)  yields  the  final  form  of  the  shape  vector 
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For  notational  purposes  it  is  convenient  to  rewrite  (3)  as 

—  I  k  k  k  k  ,  . 

. . ^ 

where  each  ^z.  represents  the  ith  measurement  component  normalized  by  the  size 
variable.  When  constructing  the  shape  vector  two  important  properties  must  be 
preserved:  1)  all  measurements  are  made  between  a  set  C  of  predefined  points,  2) 


the  shape  vector  is  independent  of  the  size  variable.  If  these  conditions  are  met,  then 
any  two  shapes  which  have  the  same  shape  vector  are  said  to  be  equal  with  respect 
to  the  measurements. 


The  K^asurement  Vector 

The  exact  nature  of  the  measurement  vector  is  described  next.  Let  each  element  of 

C  be  represented  by  the  ordered  pair  where  and  y^  are  the  spatial 

2 

coordinates  in  R  .  The  displacement  vector  between  any  critical  point  c^  and  any 
reference  point  c’=(x’,y’)  is  given  as 


+  =  (x„tx’,ynty’)  5 

2 

the  normal  Euclidean  definition  for  displacement  in  R  .The  resulting  vector  yields  a 
displacement  with  respect  to  the  origin.  With  this  definition  in  mind,  consider  I 
arbitrary  critical  points  which  constitute  the  jth  measurement  vector  formed  from 
C.  Now  form  a  reference  point,  calculating  the  centroid  of  the  I 

points  as 


1  I 

”  T  ^  ’ 


=  Tl>^ 


Then  form  the  components  of  the  measurement  vector  by  calculating  the  displacement 
between  each  critical  point  and  the  feature  centroid.  Mathematically  this  is  given  by 


m.  =  (  .x.  -  x  ,  .V 
J  J  J  1  J  c  ’ 


i=l  to  I . 


This  in  effect  translates  all  measurements  to  the  origin  of  R^,  in  other  words,  the 


2 

measurement  vector  is  an  I-tuple  of  vectors  originating  from  the  origin  of  R  .  The 
relationships  described  by  (6)  and  (7)  are  demostrated  graphically  in  figure  1 . 

Rotation,  Translation  and  the  Measurement  Vector 

It  is  a  desirable  goal  to  remove  the  effects  of  rotation,  translation  and  scaling 
from  the  shape  representation  or  to  at  least  account  for  them.  An  examination  of 
these  effects  for  the  first  two  transformations  on  the  measurement  vector  is  now 
discussed  (the  effects  of  scaling  are  closely  related  to  the  definition  of  the  size 
variable  and  will  be  described  later). 

Since  the  rotation  Rg,  of  the  shape  by  an  angle  6  is  linear  [10],  the  rotation  of  the 
arbitrary  set  (cp  C2,  c^,  ...  ,  Cj}  by  an  angle  0  about  the  origin  results  in  a 
rotation  of  the  m.’s  about  the  origin  plus  a  displacement  to  the  centroid  of  the 
rotated  feature.  This  can  be  shown  mathematically  as 


where  the  latter  two  rotations  are  about  the  origin.  The  latter  displacement  term  in 
(8)  will  be  factored  out  by  the  subtraction  in  (7) .  Thus  the  angle  of  rotation  between 
the  rotated  measurement  vector  and  the  original  measurement  vector  can  be 
calculated  for  any  component  i  as 


where  (•,•)  is  the  standard  inner  product  and  11*11  is  the  standard  norm  for  vectors 


2 

in  R  .  From  this  it  can  be  seen  that  the  only  difference  between  measurement 
vectors  from  the  rotated  set  of  critical  points  and  the  unrotated  set  is  a  rotation  9 
about  the  origin. 

Similarly,  a  translation  T,  of  the  set  (cp  C2,  c^,  ...  ,  Cj}  by  a  vector  c^^  can  be 
written  as 

•c.  =  r  ■  ;C.  1  =  T I  .m.  +.C  i  =  .m.  +  .c  +  c,  10 

J  1  U  1  i  Ij  ‘  J  i  J  1  J  t 

However,  is  just  a  new  centroid  which  will  be  factored  out  in  (7)  leaving  only 
the  original  m.’s.  Therefore,  translation  has  no  effect  on  the  measurement  vector. 

The  Size  Variable  and  the  Effects  of  Scalir^ 

The  second  condition  required  for  tte  formulation  of  the  shape  vector  is  that  the 
shape  vector  mist  be  independent  of  the  size  variable.  This  can  be  achieved  by 
makir^  the  size  variable  a  function  of  the  measurement  vector.  One  such  fuction  can 
be  formulated  as 

f  =  2  11  II 

J  i=l  J  * 

This  formulation  can  be  interpretted  as  the  spatial  energy  contained  in  the 
measurement  vector.  With  this  value  as  the  size  variable  the  sum  of  all  spatial 
energy  in  any  shape  vector  always  equals  1. 

As  a  result,  the  effects  of  scale  change  can  be  removed.  As  an  example,  consider 
the  previously  mentioned  set  of  arbitrary  points.  If  they  are  scaled  by  a  factor  a, 
their  measurement  vector  will  become 


12 


•M’  =  (  a.m,  ,  a.niT  ,  a.m-,  ,  ot.m^  ,  a. me  ,  a  mr 

J  Jij2’j3’j4’j5’j6 


which  has  total  energy  given  by 


all  2  I 

i=l  J  ‘ 


This  is  equivalent  to  the  size  variable  for  multiplied  by  ot.  When  the 


corresponding  shape  (feature)  vector  is  calculated  the  scale  factor  a  will  be  canceled 


by  the  size  variable. 


GENERAUZED  DESCRIPTION  FOR  IRREGULAR  2-DIMENSIONAL  SHAPES 


Since  initially  only  2-dimensional  shapes  are  considered  the  domain  for  a  shape, 

It _ I  O  _ _ I  I_n2  TT _ _ O _ _  I _ I_J  _ -t _ I 


called  S,  is  restricted  toR  .  The  shape  S  consists  of  a  bounded,  simple,  closed 
curve  B  [22]  and  the  area  enclosed  by  this  curve,  called  the  interior  1.  In  other 
words,  S=Bni.  The  shape  description  will  consist  of  a  finite  set  of  measurements 
extracted  from  S.  These  measurements  will  be  made  between  a  set  of  points  C, 


which  are  referred  to  as  critical  points.  The  set  of  critical  points  is  a  subset  of  B. 


In  order  to  define  this  set  C,  another  structure  called  a  concavity  tree  must  first  be 


described. 


According  to  Sklansky  [24],  the  concavity  tree  is  a  structure  for  describing 
simply  connected  silhouettes.  If  the  shape  S  is  convex  ,  then  the  concavity  tree  is 
trivially  the  shape  S  itself.  In  this  work,  it  is  assumed  that  the  shape  is  non-convex. 


this  will  be  the  case  in  most  applications  where  irregular  objects  are  to  be 


processed.  The  concavity  tree  is  described  for  the  shape  S  by  the  following 


k* 
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procedure. 

1)  Compute  the  convex  hull  of  the  shape  S,  called  This  is  defined  operationally 


14 

The  convex  hull  is  the  smallest  convex  set  which  contains  S. 

0 

2)  Subtract  the  original  shape  S  from  its  convex  hull  S  .  This  is  equivalent  to 

c  ~ 

finding  the  intersection  of  S  with  the  complement  of  S  (S).  The  set  formed  by 
S  ns  is  most  likely  a  disconnected  subset  of  S  .  Consider  each  connected  element  of 
this  subset  as  a  branch  of  the  concavity  tree  (subshape).  Denote  each  branch  by 
which  indicates  the  ith  component  of  S  OS.  Each  S.  will  have  the  same  properties  as 


3)  To  find  the  next  level  of  the  concavity  tree,  replace  S  by  S.  and  S.  by  S^j. 
Repeat  the  above  procedure  for  each  level  1  branch  of  the  concavity  tree.  The  index  j 
will  denote  the  branches  of  the  concavity  tree  at  level  2. 

4)  To  complete  the  concavity  tree,  continue  with  the  above  procedure  progressing 

from  (S.)  to  {S. .}  to  (S. }  ect.  until  all  residual  subshapes  from  the  second  step  are 
i  IJ  IJK 

convex.  The  set  of  shapes  (S.,  S. S. ...}  for  all  i,j,k,...  forms  the  concavity 
tree.  Figure  2  shows  an  arbitrary  shape  S  and  its  resulting  concavity  tree. 

Since  the  concavity  tree  is  based  on  the  relationship  between  a  shape  (subshape) 
and  its  convex  hull  there  is  only  one  unique  tree  for  each  shape  (subshape).  However 
it  is  possible  for  this  tree  to  have  a  countably  infinite  num'-ar  of  levels  and  an 
uncountable  infinite  number  of  branches,  for  example  see  Koch  curves  [65].  In  this 
work  the  class  of  shapes  examined  will  be  resticted  to  those  which  are  sufficiently 
smooth  as  to  have  only  a  finite  number  of  levels  and  branches,  and  thereby  a  finite 
number  of  critical  points.  This  restiction  still  includes  all  shapes  found  in  real 
applications  and  all  but  the  most  abstract  found  in  any  application. 

Now  that  the  concavity  tree  has  been  described,  the  definition  for  the  critical 


A 

S 


points  can  be  considered.  Let  the  set  C  represent  the  critical  points  for  a  shape  S  and 
'!'(•)  the  operation  which  yields  C  from  S, 


C  =  'I'(S). 
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The  set  C  is  equivalent  to  the  union  of  all  critical  points  extracted  from  all 
branches  of  the  concavity  tree  of  S,  i.e., 


C=  U  C 
all  levels  ^ 


16 


where, 


C.  =  U  'K  S  ), 
all  branches 


C2=  U  'I'lS  ), 
all  branches 

j 

all  branches  J 
k 

etc... 
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The  operation  for  any  shape  (subshape)  must  now  be  described.  Consider  any 
branch  of  the  concavity  tree  (S. )  and  find  its  intersection  with  the  boundary  B,  so 

ijK... 

that 


B..,  =Bns... 

ijk...  ijk... 
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This  intersection  defines  a  line  segment  for  each  branch  of  the  concavity  tree.  The 

9 


collection  of  all  such  line  segments  formed  at  each  level  of  the  tree  is  the  basis  for 
the  critical  point  sets  C^.  The  critical  points  are  finally  defined  as  the  endpoints  of 
the  B. ’s.  Figure  3  illustrates  the  boundary  segments  and  critical  points  for  the 
first  level  concavities  of  the  shape  in  figure  2.  Finally,  the  operation  '!'(•)  can  be 
summed  up  by  the  following  hierachy  of  point  set  relationships. 


C  =  'I'(S). 


U  C„, 


all  levels 


^  I  ^  '^^^iik  m’f' 

all  levels  '■  all  branches  ^  ^ 


all  levels  ^  all  branches 


U  {endpointsIB  I,  ]}}, 

r^ifV'hoc  v*  j  J  J 


in  i(  inl^  . 

all  levels  '■  all  branches j  jj 


These  relationships,  between  all  allowable  shapes  S  with  their  boundaries  and 
their  concavity  trees,  are  unique  and  well  defined.  Therefore,  the  relationships 
between  any  shape  S  and  its  corresponding  critical  pint  set  C  are  also  unique  and 
well  defined. 

Although  these  relationships  are  unique  and  well  defined,  their  exists  two  special 

cases  which  can  lead  to  some  ambiguity  if  not  handled  properly.  The  first  case 

occurs  when  two  branches  from  the  same  level  of  the  concavity  tree,  S. .  and 

'  ij...mn 

Sjj  have  the  following  proprty: 


S. .  ns..  *  =  a  single  point 

ij...mn  ij...mn  ^  ^ 


but  the  intersection  of  the  open  sets  formed  by  S. .  _  -  S. .  ’is  empty.  When 
this  happens  the  subsecjent  boundary  segments  associated  with  the  two  branches  of 
the  concavity  tree,  and  S.^  will  in  fact  be  a  single  line  segment.  To 

remove  any  ambiguity,  the  single  point  described  in  (20)  will  be  used  to  partition 
this  line  segment  into  two  seperate  ones.  This  sir^le  point  will  be  called  "critical" 
and  will  be  considered  as  two  seperate  points,  each  with  singular  associations  to 
S. .  and  S. .  ’  respectively. 

The  second  case  is  similar  to  the  first  except  involves  an  overlapping  of  critical 

points  from  different  levels  of  the  concavity  tree.  In  other  words,  the  sets  are  not 

mutually  exclusive.  This  will  not  be  a  problem  as  lorg  as  the  membership 

2 

information  within  the  C^’s  is  retained  along  with  spatial  information  in  R  , 

2 

Consequently,  the  location  of  critical  points  within  R  and  the  knowledge  of  the 
levels  and  branches  of  the  concavity  tree  from  which  they  originated  are  of  equal 
importance. 

As  a  rule  of  thumb,  each  branch  of  the  concavity  tree  will  generate  two  critical 

points.  Each  of  these  critical  points  can  be  represented  by  a  triplet  (x,y,l),  where  x 

2 

and  y  represent  the  location  of  the  points  in  R  and  1  indicates  the  level  of  the 
concavity  tree  at  which  the  points  were  generated;  x  and  y  are  real  valued  numbers;  1 
is  integer  valued. 


2 

Since  initially,  only  shapes  which  are  invariant  in  R  are  allowed  three 
transformations  must  be  considered.  The  three  transformations  -  scaling,  rotation 
and  translation  -  are  all  well  known  mappings  from  R  to  R  ,  and  are  represented 
symbolically  by  R0(S)  and  7(3)  respectively.  They  are  defined 

mathematically  in  the  following.  The  functions  S^(S),  RglS),  7(S)  map  a  shape  S 


into  a  shape  S’  such  that  all  points  {x,y),  elements  of  S  are  mapped  to  points  in  S’ 
by 


S 


a 


(x,y)  1  =  (  orx  ,  ay )  =  (  x’ ,  y’ ), 
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(x,y)  1  =  (  X  COS0  -  y  sin0  ,  x  sin0  +  y  cos0  )  =  (  x’  ,  y’ ) , 


T  |(x,y)  ■  =  (  X  +  x^ ,  y  +  y^ )  =  (  x’  ,  y’ ), 
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where  a,  0,  x^,  y^  6  R,  (x,y)  6  S  C  R^,  (x’,y’)  6  S’  C  R^. 

If  a  is  restricted  to  beir^  positive  and  finite,  0  to  beir^  greater  than  zero  and  less 
than  two  pi,  and  x^,  y^  to  being  finite,  then  all  three  mappings  are  not  only  onto  but 
one-to-one  [10]. 

Using  the  definitions  for  the  transforms  of  scale,  rotation  and  translation  the 
following  theorem  can  be  formulated. 

Theorem:  the  set  of  critical  points  C  obtained  for  an  arbitrary  shape  S  which  has 
undergone  one  of  the  transformations  S^,  Rg  or  T  is  identical  to  the  set  of  critical 
points  for  the  mtransformed  shape,  subjected  to  the  same  transform.  This  can  be 
stated  mathematically  as: 


'KS^(S))  =  S^('I'(S)), 

'I' (  ^0  ( S ))  =  Rq  ( vP  ( S )), 

'i'(r{S))  =  r('KS)).  24 


The  proof  of  this  theorem  appears  in  the  appendix. 


In  general  compositions  of  and  !(•)  are  not  commutative  because  of 

the  nonlinearity  of  T{*)\  therefore  their  order  must  be  preserved.  However, 
combinations  involving  only  S^(«)  and  Rq(*)  are  commutative  [10]. 


The  next  question  to  be  addressed  involves  the  invariance  of  critic-!  point 

determination  when  only  partial  shape  information  is  available,  this  problem  can  be 

divided  into  two  simpler  and  similar  type  problems,  namely,  truncated  shape 

representation  and  composite  shape  representation.  In  the  first  case,  it  is  desirable 

to  determine  a  source  shape  S  from  a  given  shape  S’  where  S’CS  and  B’nB=0.  In  the 

second  case,  it  is  desirable  to  determine  two  or  more  source  shapes  .S  from  S’ 

where  S’=  U  .S  and  B’n.B=0  for  all  i. 
all  i  *  ‘ 

In  order  to  accomplish  either  of  the  above  task  it  is  necessary  for  a  subset  of 
critical  points  from  each  source  shape  to  remain  unaltered  in  S’.  Since  the  critical 
points  are  findamentally  linked  to  the  cwicavity  tree  of  the  source  shape,  this  is 
equivalent  to  requiring  that  a  portion  of  the  concavity  tree  remain  unaltered. 

In  the  case  of  truncation,  the  portion  of  the  concavity  tree  which  is  distorted  can 
be  identified  by  examining  the  convex  hulls  of  the  whole  and  partial  shapes,  or  S  and 
S’  respectively.  Those  subshapes  or  branches  of  the  concavity  tree  which  do  not 
intersect  a  portion  of  (S^-S^ )  remain  inaltered.  An  example  of  this  relationship  is 
illustrated  in  figure  4  for  an  arbitrary  shape  S  and  subshape  S’.  From  this  it  can 
be  seen  that  each  branch  of  the  concavity  tree  acts  as  a  generating  node  for  all 
subsequent  branches  in  that  portion  of  the  tree.  Any  node  which  remains  unaltered 
will  generate  sub-level  branches  identical  to  those  of  the  original  tree.  A  comparison 
between  the  branches  of  the  original  shape  S  from  figure  4  and  its  corresponding 
truncated  shape  S’  is  shown  in  figure  5.  The  branches  inscribed  by  squares  are 


identical  in  both  trees. 

In  the  case  of  composite  shapes,  the  effects  are  more  complicated.  Not  only  are 
branches  of  the  original  tree  truncated,  but  new  branches  may  be  created  and  others 
augmented.  This  occurs  because  1)  the  uiion  of  the  convex  hulls  of  the  combinig 
shape  is  not  necessarily  convex,  thereby  creating  new  concavities  and  2)  concavities 
from  one  shape  may  overlay  interior  areas  of  another.  These  possibilities  are 
illustrated  in  figure  6  for  a  composition  of  the  shape  used  previously  in  figure 
4. 

In  general  any  branch  of  the  concavity  tree  which  intersects  a  portion  of  the 

overlap  region  or  intersects  a  concavity  \>^ich  includes  the  difference  between  S 

and  U  .S^  will  be  distorted.  In  certain  cases  it  is  possible  for  certain  branches,  a 
alli‘ 

subset  of  the  distorted  portion  of  the  tree,  to  remain  unaltered.  However  the 
conditions  mder  which  this  occurs  are  complex  and  specific  to  the  particular  shape. 

In  the  next  section,  the  gap  between  the  theoretical  shape  description  developed  in 
the  previous  section  and  a  discrete  implementation  for  digitized  shapes  is  bridged. 
The  effects  of  finite  bouidary  representation,  quantization  noise  and  additive 
observation  noise  are  also  considered. 

Ill  CRITICAL  POINTS  FROM  REAL  DATA 

The  definition  of  critical  points  advanced  in  the  previous  section  can  be  restated  as 
follows:  critical  points  are  the  endpoints  of  the  line  segments  formed  by  the 
intersection  of  a  shape’s  boundary  B  with  the  branches  (subshapes)  of  its  concavity 
tree.  If  the  shape  boundary  is  continuous  and  can  be  observed  without  error,  then 
these  critical  points  can  be  exactly  determined  for  all  combinations  of  scaling, 
rotation  and  translation.  However,  in  the  discrete  case  the  shape  is  usually 
represented  by  a  finite,  integer-valued  sequence.  This  sequence  is  almost  always  in 


the  form  of  a  chain  code  or  a  thinned  boundary  code  [12].  In  this  work  it  is  assumed 
that  the  sampled  boundary  has  been  properly  preprocessed  so  that  it  meets  the 
discrete  definitions  of  being  closed  arid  single-connected.  These  definitions  are 
described  in  [22]  along  with  various  techniques  for  processing  images  which 
guarantee  that  these  conditions  are  met. 

With  these  requirements  in  mind,  let  the  processed  shape  be  represented  by  the 
sequence  '*^ich  corresponds  to  the  quantized  estimate  of  the  shape 

boundary.  The  coordinates  of  the  sequence,  x.  and  y.,  should  be  eight-connected  and 
integer-valued.  Because  only  boundary  information  is  available  the  previous  definition 
for  critical  points  will  have  to  be  reinterpreted. 

The  general  procedure  for  extractir^  critical  points  is  illustrated  by  the  flow  chart 
in  figure  7.  At  the  front  end  of  the  system  is  a  spatial  filter  which  is  chosen  to 
minimize  the  effects  of  the  quantization  and  observation  noises.  At  the  next  stage, 
the  convex  hull  information  about  the  shape  is  obtained.  A  decision  process  is  then 
required  to  identify  and  confirm  critical  points.  Depending  on  the  output  of  the 
decision  stage,  the  shape  is  either  segmented  into  the  next  level  of  branches  of  its 
concavity  tree  or  the  procedure  is  terminated.  This  procedure  is  recursive,  with  each 
pass  around  the  loop  corresponding  to  the  processing  of  one  level  of  the  concavity 
tree. 

The  purpose  of  the  spatial  filter  is  to  reduce  the  effects  of  the  quantization  noise 
and  the  additive  noise  in  the  boindary  sequence.  The  type  of  filter  chosen  should  be 
optimized  for  the  statistics  of  the  noise  processes  and  the  source  shapes  [1]. 
In  general  these  statistics  depend  on  the  imaging  equipment  and  the  set  of  source 
shapes,  and  cannot  be  determined  globally.  In  fact,  they  are  most  likely  nonstalionary 
random  processes.  On  the  other  hand,  if  the  filter  is  made  robust,  good  results  can 
still  be  obtained.  Therefore,  an  averagir^  filter  is  chosen. 

The  output  of  the  filter  will  be  a  real-valued  sequence  (x^,y^}|lj,  with  the  same 
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number  of  elements  as  the  input  sequence.  By  increasing  the  length  of  the  filter 
window  the  effects  of  the  noise  can  be  made  arbitrarily  small.  However,  the 
smoothir^  effects  may  cause  smaller  features  of  the  shape  to  be  eliminated.  In  many 
cases,  this  may  be  considered  advantageous  because  a  simultaneous  reduction  in 
feature  processing  and  increased  noise  immunity  is  obtained. 

The  next  step  is  to  find  those  points  from  the  sequence  which  are 

elements  of  the  boundary  of  the  con^«x  hull  of  the  digitized  version  of  S.  This 
boundary  is  equivalent  to  the  Minimum  Perimeter  Convex  Polygon  (MPCP)  enclosing 
S.  It  can  be  calculated  using  a  modified  version  of  the  routine  described  by  Fr'eeman 
and  Shapira  [7].  This  modified  version  is  adapted  to  process  real  data  instead  of 
integer  data.  It  requires  a  longer  running  time  but  still  iterates  to  a  final  solution  in 
all  cases.  The  actual  convex  hull  is  not  required,  only  the  perpendicular  distance 
from  each  point  of  the  shape  boundary  to  the  MCPC.  This  distance  is  referred  to  as 
the  difference  sequence 


The  procedure  for  determinir^  the  critical  points  from  the  difference  sequence  is 
now  described.  As  a  first  step,  an  interpretation  of  the  definition  of  critical  pints 
must  be  aplied  in  the  context  of  the  difTerence  sequence.  Since  the  difference 
sequence  measures  the  distance  between  the  boundary  of  S  and  the  boundary  of  S  ,  it 
can  be  used  to  find  the  intersection  defined  in  (18).  This  intersection  for  the 
discrete  case  correspnds  to  those  pints  of  which  pual  zero.  The  endpoints 

(critical  pints)  of  the  associated  line  segments  are  just  those  pints  where  the 
difference  sequence  goes  from  a  zero  value  to  a  nonzero  value.  If  the  shape  boundary 
is  observable  without  error,  then  the  critical  pints  can  be  determined  exactly; 
however,  this  is  not  the  case  when  aplied  to  any  discrete  data.  Therefore,  the 
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critical  points  will  be  determined  by  finding  certain  transition  regions  in  the 

difference  sequence.  This  will  compensate  for  most  of  the  errors  due  to  observation 

noise  as  well  as  errors  incured  by  the  resampling  of  shapes  which  have  undergone 

one  or  more  of  the  transformations  discussed  previously.  Of  these  'ee,  only  the 

transformation  of  scale  seriously  effects  the  difference  sequence  (the  maximum 

error  resultir^  from  the  other  two  transformations  is  limited  to  2/V2’  times  the 

quantization  grid  size).  Since  the  sptial  energy  contained  in  the  shape  is  directly 

proportional  to  the  area  of  the  quantization  grid  which  it  covers,  any  reduction  in 

2 

scale  by  a  factor  a  results  in  a  simultaneous  reduction  in  signal  to  noise  ratio  of  a  . 
Because  of  this,  certain  compromises  must  be  made  in  the  performance  of  the 
critical  point  extraction  algorithm,  specifically,  it  will  be  designed  to  operate  over  a 
limited  range  of  scale  factors.  To  be  more  precise,  the  resulting  procedure  tries  to 
predict  the  behavior  of  the  critical  points  over  the  design  range  of  scale  factors.  This 
reduces  to  a  more  basic  problem  of  distinguishing  concavities  formed  by  the  noise 
processes  from  those  concavities  due  to  the  shape  structure.  The  exact  procedure  for 
this  is  described  next. 

The  input  shape  for  extracting  critical  points  for  the  shape  dictionary  is  evaluated 
at  a  equal  to  one  (the  maximum  scale  factor).  This  corresponds  to  the  highest  signal 
to  noise  ratio  expected  for  the  shape.  The  first  step  is  to  isolate  a  set  of  potential 
critical  points.  This  is  accomplished  by  comparing  the  difference  sequence  to  a 
threshold,  r^.  When  the  absolute  value  of  the  difference  sequence  is  greater  than 
a  concavity  is  formed.  The  points  inside  the  interval  [0,rj]  closest  to  the  position 
where  the  threshold  is  crossed  are  considered  as  potential  critical  points  and 
added  to  the  critical  point  list.  These  points  are  in  direct  correspondence  with  the 
endpoints  of  the  line  segments  in  (18).  They  determine  certain  subsequences  of 
(d.)._j  called  concavity  sequences.  The  concavity  sequences  (d.)  are  the  sequential 
portions  of  which  are  greater  than  r^.  Associated  with  each  concavity 


■t  «■  .» 


sequence  are  a  pair  of  critical  points,  one  at  each  end.  The  first  threshold  simply 
identifies  possible  critical  points.  A  second  threshold  is  used  to  evaluate  the  most 


probable  underlying  cause  of  a  concavity. 


The  second  threshold  T2,  which  is  greater  than  r^is  used  to  eliminate  critical 
points  associated  with  small  concavitities  from  the  list.  It  also  helps  remove  most 


concavities  which  are  formed  strictly  from  observation  noise.  For  each  sequence  of 


c  c 

points  within  a  concavity,  (d.},  the  maximum  absolute  value  for  the  d.’s  are  found. 


Q 

If  this  value,  called  d  ,  is  less  than  then  the  critical  points  associated  with 
rn3x  z 

the  concavity  in  question  are  eliminated  from  the  critical  points  list.  Since  it  is 


impossible  to  distinguish  small  shape  corx;avities  from  those  produced  by  noise  with 


just  one  sample,  the  alternative  is  to  eliminate  them  all.  The  reasoning  being  that 


the  total  number  of  shape  concavities  eliminated  will  be  small  compared  to  the 


number  of  noise  produced  concavities.  This  can  be  justified  theoretically  by  assuming 


that  the  average  maximum  depth  of  the  actual  shape  concavities  is  much  greater  than 
the  maximum  depth  of  the  noise  concavities.  This  is  illustrated  in  figure  8  for  a 
generalized  representative  shape.  This  type  of  decision  is  analogous  to  the  classical 
maximum-likelihood  decision  criteria  [16].  Note  that  the  depth  of  the  true 
concavities  can  be  considered  as  the  signal  energy,  and  the  depth  of  the  noise  related 
concavities  as  the  noise  energy.  From  figure  8  it  can  be  seen  that  for  appropriate 


thresholds  r  ^  and  T2  most  of  the  effects  of  the  noise  can  be  eliminated,  provided  the 
signal  to  noise  ratio  is  sufficient.  The  matching  algorithm,  to  be  described  later  in 


this  chapter,  is  designed  to  be  very  tolerant  of  missed  critical  points  as  long  as  they 
are  listed  in  the  dictionary.  However,  it  is  susceptible  to  error  if  unexpected  critical 


points  are  fomd  in  the  problem  text.  Therefore,  the  threshold  r2  is  biased  by  an 
amoirt  when  the  shape  dictionary  is  being  prepared.  The  direction  of  the  bias  is 
in  the  direction  of  higher  false  alarm  rates  for  concavities  due  to  noise. 


Now  that  the  majority  of  potential  critical  points  have  been  isolated,  it  is 
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necessary  to  find  a  method  of  predicting  their  behavior  in  the  problem  text  over  the 
design  range  of  scale  factors.  The  effects  of  changes  in  a  on  the  {d't/’s  falls  Into  two 
categories:  1)  concavities  found  at  the  maximum  value  of  a  may  collapse  below  the 
t2  threshold  and  their  associated  critical  points  eliminated  from  the  problem  text; 
2)  segments  of  the  difference  sequence,  which  are  outside  the  interval  [0,r2]  for 
large  a,  may  cross  the  r2  threshold  and  create  new  critical  points.  Of  these  two 
possibilities,  the  first  case  has  been  observed  to  happen  most  frequently.  This  can 
be  explained  as  follows.  Because  the  depth  of  a  concavity  is  determined  from  two 
extremal  points  of  opposite  direction,  namely,  the  outer  most  points  on  the  shape 
boundary  contained  in  the  convex  hull  and  the  inner  most  shape  pints  correspnding 
to  the  maximum  depth  of  a  concavity,  the  smoothing  effect  of  the  spatial  filter 
converges  these  two  pints  approximately  twice  as  fast  as  distances  based  on  only 
one  of  the  extremal  pints,  i.e.  case  two. 

In  order  to  predict  the  occurrence  of  these  two  changes,  two  additional  thresholds 
are  introduced.  Their  purpose  is  to  assign  a  measure  to  regions  of  the  difference 
sequence  which  may  cause  one  of  the  above  charges.  This  measure,  called  the 
confidence  number,  will  be  assigned  individually  to  each  critical  pint  in  the  shape 
dictionary.  The  confidence  number  indicates  the  prcentage  of  the  scale  factor  range 
in  which  a  prticular  critical  pint  is  detectable  as  well  as  the  order  in  which  it 
enters  (or  leaves)  the  problem  text. 

The  first  of  these  two  thresholds,  r^,  is  used  to  identify  candidate  critical  pints 
which  may  be  detected  in  the  problem  text  for  smaller  values  of  or.  If  any  spuential 
prtion  of  (d’r)  for  any  concavity  has  the  proprty  that 


^2  <  M|  I  ^  ^3 
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for  all  d.  in  that  group,  then  a  ptential  critical  pint  is  indicated.  The  two  points 
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where  the  sequence  {d^}  indicated  by  (25)  crosses  into  the  interval 
to  the  shape  dictionary  and  assigned  a  confidence  number,  given  by 
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where  d^^.^  is  the  minimum  absolute  value  of  the  difference  sequence  in  region 
[r2,r2].  An  example  of  this  situation  is  illustrated  in  part  a  of  figure  9. 

The  second  new  threshold,  r^,  is  used  to  measure  the  proportion  of  scale  factors 
at  which  a  particular  concavity,  and  its  associated  critical  points,  can  be  detected. 

For  each  concavity  sequence  with  d  greater  than  r.,  assign  a  confidence  number 

niaX  u 

according  to  the  following: 


“■  'Lx  >  *c  = 


else 
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This  situation  is  illustrated  in  part  b  of  figure  9.  Note  that  is  generally  twice 
as  large  as  and  that  ^3  )  ^2  ^  ^1‘ 

At  this  point  the  algorithm  for  extracting  critical  points  and  assignment  of 
confidence  numbers  for  a  single  level  of  the  concavity  tree  has  been  described.  To 
extend  the  procedure  to  the  rest  of  the  concavity  tree,  the  recursive  nature  of  the 
algorithm  must  be  defined.  Since  the  concavity  sequences  correspond  to  the  next  level 
of  concavity  tree  branches,  they  can  be  processed  individually  in  the  same  manner  as 
the  original  shape.  The  only  modification  required  is  the  reversal  of  the  value  for 
s  in  (9)  at  each  new  level.  New  critical  points  are  added  to  the  shape  dictionary  as 
previously  described.  Confidence  numbers  are  assigned  using  the  same  criteria  as 
before  with  the  following  exception:  concavities  which  have  confidence  numbers  less 


than  1,  have  all  sublevel  critical  points  assigned  a  confidence  number  calculated  as  a 
product  of  the  current  level  confidence  numbers  with  the  values  determined  according 
to  (26)  and  (27).  The  process  is  propogated  through  the  concavity  tree  until  no  new 
points  are  found.  Figure  10  and  11  show  the  difference  sequence  and  critical  points 
respectively  from  an  arbitrary  shape  for  levels  1,  2  and  3  of  the  concavity  tree 
respectively.The  resulting  dictionary  page  contains  the  following  information 
extracted  sequentially  from  the  shape  boundary  for  each  critical  point:  1)  its  x  and  y 
coordinate  values,  2)  its  confidence  number,  3)  the  level  of  the  concavity  tree  at 
which  it  was  found. 

Shape  Dictionary  Calculation  for  3-Dimensional  Objects 

Consider  an  arbitrary,  rigid  object  as  shown  in  figure  12,  where  9,0  and  0 

X  y 

describe  angles  of  rotation  in  the  three  axes  of  the  rectangular,  object -centered 
coordinate  system.  The  z-axis  of  the  object-centered  coordinate  system  is  restricted 
to  parallel  alignments  with  the  z’-axis  of  the  viewer-centered  coordinate  system. 
This  restriction  results  in  the  following  interpretations  for  motion  attributed  to  the 
object.  Tiie  imaging  plane  shown  in  figure  12  is  the  space  in  which  the  object  is 
projected 

1)  Translation  of  the  object  (  or  the  object-centered  coordinate  system  )  with 
respect  to  the  z’-axis  results  in  a  scale  change  in  the  imaging  plane. 

2)  Translation  of  the  object  {  or  object-centered  coordinate  system  )  along  either 
or  both  of  the  x’  and  y’  axes  results  in  a  translation  in  the  imaging  plane. 

3)  Rotation  of  the  object  by  an  angle  9^  about  the  z-axis  of  the  object-centered 
coordinate  system  results  in  a  rotation  by  angle  0^  in  the  imaging  plane. 

4)  Rotation  by  the  angles  9^  and  6^  in  the  x  and  y  axis  of  the  object-centered 
coordinate  system  results  in  a  projection  of  the  3-dimensional  object  in 


the  imaging  plane 


The  first  three  transformations  are  the  same  transformations  which  have  been  shown 
not  to  effect  the  critical  point/feature  vector  formation  procedure.  Therefore,  only 
the  fourth  transformation  will  necessitate  a  modification  in  the  procedure 
previously  described  for  2-dimensioral  shape  recognition. 

The  necessary  modifications  will  result  in  an  expanded  shape  dictionary.  Now 
instead  of  one  set  of  critical  points  for  each  shape,  there  will  be  several  sets;  each 
representing  a  different  rotation  about  object-centered  axes  x  and  y.  Therefore  the 
following  procedure  is  suggested  for  assembling  the  3-dimensional  shape  dictionary. 


1)  With  the  object  at  its  minimum  distance  from  the  z’  plane  (maximum  scale 

factor)  obtain  a  projection  of  the  object  in  the  viewer  imaging  plane  with  0  and  6 

X  y 

at  zero  degrees. 

2)  Next,  obtain  the  outer  contour  of  the  projected  shape  and  extract  its  critical 
points  usir^  the  procedure  detailed  earlier.  Store  as  the  first  dictionary  page 
for  this  shape. 

0 

3)  Increment  9^  by  0.^  (  a  divisor  of  180  )  and  repeat  steps  L  and  3  to  obtain 

the  next  page  in  the  dictionary  for  this  shape.  Continue  incrementing  9^  until  a 
0 

rotation  of  180  has  been  reached. 

4)  Reset  0^  and  increment  9  by  0.^.  Keeping  0  fixed  at  the  new  value,  repeat 

0 

steps  1  thru  3  mtil  9^  has  been  incremented  to  180  . 


0  2 

The  shape  dictionary  will  now  contain  (  180  /  0.  )  views  of  the  3-dimensional 

inc 

0  0 

object.  The  other  views  ( for  0^  and  0^  between  180  and  360  )  are  mirror  images 
of  those  already  calculated,  and  may  be  extracted  from  the  existing  dictionary  by 
processing  the  critical  points  in  reverse  order. 
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The  choice  of  9.^^  directly  effects  the  accuracy  and  completeness  with  which  the 
3-dimensional  object  is  described  as  well  as  the  amount  of  memory  needed  '  as  3. 
is  reduced  the  accuracy  of  the  description  is  increased  along  with  the  amount  of 
memory  required  to  store  the  dictionary  ).  With  regard  to  the  accuracy  of  the  3- 
dimensional  shape  description,  an  unintended  but  benificial  side  effect  is  obtained 
from  the  extraction  of  the  critical  point’s  using  the  concavity  tree.  As  the  3- 
dimensional  object  is  rotated,  various  sections  of  the  object  become  visible.  The 
projections  to  these  sections  onto  the  imaging  plane  manifest  themselves  as  new 
concavities  in  the  corresponding  2-dimensional  silhouette.  Depending  on  the  direction 
of  rotation,  these  new  concavities  are  either  entering  or  leaving  the  concavity  tree. 
Since  this  is  the  essence  of  the  information  measured  by  the  confidence  number, 
additional  accuracy  is  gained.  This  is  especially  useful  when  the  actual  angle  of 
rotation  for  the  problem  text  is  not  a  multiple  of  6,^^. 

Critical  Point  Determination  for  the  Problem  Text 

When  the  boundary  of  an  unknown  shape  is  obtained  a  critical  point  list  (problem 
text)  must  be  configured  for  it.  The  procedure  by  which  this  is  done  is  quite  similar 
to  the  one  described  in  the  previous  section;  however,  no  confidence  number  is 
calculated.  This  is  due  to  the  fact  that  an  exact  value  of  the  scale  factor  a  is 
unknown.  As  a  consequence  only  those  portions  of  the  difference  sequence  partitioned 
by  the  thresholds  and  I2  l-o  determine  concavities.  These  concavities 

may  be  formed  in  three  ways  which  are  illustrated  in  figure  13,  parts  a,  b  and  c.  In 
the  first  case  the  and  V2  thresholds  are  both  crossed  in  forming  a  concavity;  for 
this  case,  the  critical  points  for  this  concavity  are  chosen  as  the  points  nearest  to 
threshold  inside  the  interval  [0,rj].  In  the  second  case  only  the  t~.  threshold  is 
crossed;  therefore,  the  nearest  points  to  T2  inside  the  interval  [0,r.,l  are  chosen  as 


i 
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critical.  In  the  last  case,  only  threshold  is  crossed;  therefore,  no  critical  points 
are  chosen  since  the  concavity  is  considered  too  small  to  register.  As  a  firai  result 
the  problem  text  contains  the  following  information  extracted  sequentially  f.-^m  the 
unknown  shape’s  boundary  for  each  critical  point:  1)  its  x  and  y  coordinate  values,  2) 
the  level  of  the  concavity  tree  at  which  it  was  detected. 


IV  COGNITIVE  STAGE 


Once  a  problem  text  has  been  defined  it  must  be  matched  against  all  the  dictionary 
pages  (shapes).  Since  the  algorithm  is  required  to  recognize  partial  shapes  as  a  part 
of  a  larger  shape,  features  must  be  formed  and  matched  accordingly.  The  basis  for 
comparison  is  the  shape  or  feature  vector  described  previously.  It  consists  of 
measurements  made  from  I  arbitrary  critical  points  selected  from  the  shape 
dictionary  or  problem  text.  As  a  first  step,  the  process  by  which  each  feature  vector 
is  selected  must  be  made  non-arfaitrary.  Four  general  conditions  are  suggested  for 
choosir^  the  individual  critical  points;  they  are  as  follows: 

1)  The  set  of  all  feature  vectors  should  give  near  equal  representation  to 
all  parts  of  the  shape. 

2)  The  total  number  of  feature  vectors  is  kept  as  small  as  possible. 

3)  Each  feature  vector  represents  a  feature  likely  to  appear  under  certain 
conditions  of  translation,  scale  or  rotation. 

4)  Each  feature  represents  a  localized  characteristic  of  the  shape. 

The  conditons  for  the  representative  feature  set  can  be  met  by  combining  the 
concavity  tree  information  with  the  confidence  number  information  contained  in  the 
shape  dictionary.  The  procedure  for  this  is  as  follows.  Let  each  pair  of  critical 


points  associated  with  a  given  concavity  act  as  seed  points  for  a  feature  vector. 
Choose  the  two  most  reliable  neighboring  critical  points  to  each  side  of  the  seed 
points  to  complete  the  sixtuple.  This  method  guarantees  at  least  one  feature  vector 
for  each  branch  of  the  concavity  tree.  Furthermore,  it  restricts  each  feature  to  three 
adjacent  concavities  for  a  given  scale  factor.  Finally,  a  linear  growth  rate  is 
achieved  -  if  there  are  N  critical  points  then  there  will  be  N/2  feature  vectors.  The 
method  for  selectir^  the  most  reliable,  neighboring  critical  points  is  described  next. 
Recall  that  the  confidence  number  not  only  indicates  the  percentage  of  scale  factors 
for  which  a  critical  point  is  detectable  but  also  the  order  in  which  they  are  detected. 
For  example,  if  and  represent  the  confidence  numbers  for  two  critical  point 
pairs  -  a  and  b  respectively,  and  then  if  pair  b  is  detected,  pair  a  should 

also  be  detected.  Therefore,  for  a  given  critical  point  pair  with  confidence  number 
choose  the  two  nearest  neighboring  critical  points  which  have  confidence  numbers 
greater  than  or  equal  to  When  considering  the  problem  text,  information 
concerning  scalir^  is  unavailable;  therefore,  the  simplest  scheme  possible  is 
derived,  i.e,  each  consecutive  grouping  of  six  critical  points  forms  a  feature  vector. 

d. 

Let  the  feature  vectors  from  the  shape  dictionary  be  denoted  by  where  k 

indicates  the  page  of  the  dictionary  and  m  the  mth  feature  vector  within  page  k. 

P 

Similarly  denote  the  nth  feature  vector  of  the  problem  text  by  ^Z. 

Step  1)  Associated  with  each  feature  vector  of  the  dictionary  and  the  problem  text 
is  a  sixtuple  called  the  level  code  vector.  Each  component  of  the  level  code  vector  is 
integer  valued  and  indicates  the  level  of  the  concavity  tree  at  which  the  corresponding 
critical  point  was  detected.  The  symbol  for  the  level  code  vector  associated  with 
feature  vector  ^Z  (^Z)  is  ^1  ( ^1).  For  feature  vectors  to  match  it  is  necessary 
that  their  level  code  vectors  also  match.  Therefore,  in  the  first  stage  of  the 
procedure  all  of  the  level  code  vectors  from  the  shape  dictionary  page  and  the 
problem  text  are  cross  matched.  The  distance  defined  by 
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is  the  basis  for  each  level  code  vector  comparison.  A  value  of  zero  for  d^(m,n)  is  a 
necessary  condition  for  two  feature  vectors  to  match.  This  distance  is  calculated  for 
all  m6M|^  and  all  n6N.  This  cross  referenced  set  is  referred  to  as  NXMj^.  Each  pair 
(m,n)  such  that  dj(m,n)=0,  is  added  to  the  reduced  feature  set  called  NXMj^.  Since 
dj(m,n)  requires  only  (2*1-1)  integer  operations,  compared  to  (21*1-2)  real  valued 
operations  for  a  complete  feature  vector  comparison,  computational  overhead  is 
significantly  reduced. 

Step  2)  For  each  pair  of  features  associated  with  the  elements  of  the  reduced  set 
NXMj^  calculate  a  pair  of  feature  vectors  using  the  definitions  methods  outlined  in 
chapter  III.  Next  evaluate  the  distance  dj(m,n)  defined  by 


29 


for  each  component  of  the  feature  vector  pair.  In  the  noiseless  case  the  distance 
d.(m,n)  is  zero  for  all  components  of  the  feature  vectors  m  and  n  if  they  match. 
Since  this  is  never  the  case  a  finite  positive  distance  must  be  allowed  when 
evaluating  (29).  Therefore,  if  d.(m,n)<r^  for  i  =  1  to  I,  then  the  feature  vectors 
are  said  to  match.  The  set  of  matching  feature  vector  pairs  from  this  stage  form  the 
new  set  NXMj^’.  It  should  be  mentioned  at  this  point  that  a  zero  value  for  all 

components  i  with  respect  to  the  distance  d.(m,n)  is  not  a  sufficient  condition  for  a 
match,  since  only  the  magnitude  information  from  each  component  is  considered  in 
(29).  On  the  other  hand  the  angle  information  is  examined  in  the  next  phase  of  the 
algorithm  and  errors  rarely  occur  due  to  this  in  the  final  evaluation. 

Step  3)  Encoded  within  each  group  of  matching  feature  vectors  is  information  about 
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the  scale  and  rotation  parameters  of  the  problem  text,  a  and  0  respectively.  If  a 
noiseless  case  is  assumed,  then  the  a  parameter  is  equal  to  the  ratio  of  the  two  size 
variables  and  the  9  parameter  is  equal  to  the  angle  between  the  feature  vectors. 

Since  perfect  knowledge  of  the  feature  vectors  is  never  obtained  the  values  of  a  and 
0  can  only  be  estimated.  The  eqciations  for  the  estimates  of  a  (a)  and  0  (0)  are 
given  by  (30)  and  (31)  respectively. 


These  estimates  are  obtained  for  each  pair  of  matching  feature  vectors  in  the  set 
Note  that  the  estimate  in  30  is  weighted  by  the  magnitude  of  the  individual 
components  so  that  large  errors  in  small  components  do  not  overly  bias  the  estimate. 

Step  4)  By  this  point  there  are  only  a  few  possible  matches  remaining,  the 
problem  now  is  to  pick  the  best  match  of  the  remaining  few,  or  to  reject  all  of  them 
if  they  are  false  matches.  Since  feature  vectors  from  the  same  shape  should  all  have 
the  same  scale  factor  and  angle  of  rotation,  clusters  in  the  estimates  of  a  and  0 
should  be  formed.  The  following  procedure  is  used  to  define  such  a  cluster.  Let  p  be 


the  index  on  set  NXM”  with  values  from  1  to  nm  and  form  the  vectors  [  a,  ,...,  a 
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,...,  ]‘  and  [  0j  0^  ,...,  0^  )  from  the  estimates  in  step  3.  Now 


calculate  the  two  circulanl  difference  matrices  T  and  Tn  and  normalize  them  bv  r 

a  u  '  c 

and  Tg  so  that 
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The  values  and  rg  represent  the  maximum  allowable  intra-cluster  distances  for 
the  estimates  of  parameters  a  and  9  respectively.  Next  calculate  Tq  the  total 


difference  matrix  between  all  estimates  as 


Td  =  V 


where  each  operation  is  performed  element  wise.  Tpj  will  be  an  nm  by  nm 

/~2  T 

difference  matrix  normalized  by  V  •  To  find  the  clusters  find  the  matrix 
formed  by  settir^  all  elements  of  Tq)  1  to  zero  and  all  others  to  1 .  Now  each  row 
(or  column)  contains  a  1  for  each  estimate  that  is  clustered  with  the  indexing 
estimate.  In  general  there  should  be  only  one  cluster  if  there  is  a  match;  however, 
the  above  procedure  may  also  produce  several  subclusters  of  the  larger  cluster.  In 
order  to  pick  the  best  cluster,  the  number  of  terms  and  the  strength  of  the 
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individual  matches  between  each  feature  vector  should  be  considered.  The  .‘"oilc.v'in? 


decision  scheme  is  chosen  to  select  the  final  cluster. 


1)  Total  the  number  of  members  in  each  cluster,  call  this  number  GCNT 


2)  Multiply  each  group  member  by  the  weighting  factor 


1  - 

i=l  ^ 


this  indicates  the  degree  of  match  between  each  feature  vector  in  the 


cluster.  Sum  this  number  and  call  it  GSUM 


3)  Add  the  value  r^«(GCNT  -  1)  to  GSUM  and  find  the  average  value  of  this 


total  for  each  group;  GAVG  =  (  r^*(GCNT  -  1)  +  GSUM) /GCNT 


4)  Pick  the  cluster  for  which  GAVG  is  maximum. 


To  gel  the  final  estimates  of  a  and  9,  find  the  average  of  the  individual  a.  and  0.  for 
the  chosen  cluster.  If  there  is  no  cluster  the  above  procedure  reduces  to  picking  the 


individual  feature  vector  with  the  highest  degree  of  match. 


Unlike  a  and  0  the  translation  of  a  whole  shape  is  not  equivalent  to  the  same 


translation  in  each  of  its  feature  vectors  (recall  that  translation  is  not  a  linear 


operation).  Therefore,  scale  and  rotation  information  is  required  to  eslim.ate  (the 
translation  vector  between  the  centroid  of  the  dictionary  shape  and  the  centroid  of 
the  problem  shape  which  have  been  matched).  To  estimate  z^^  the  matched  shape 
dictionary  is  scaled  and  rotated  with  respect  to  its  centroid  using  the  final  estimates 


obtained  for  a  and  0.  The  rotated  and  scaled  versions  of  the  dictionary  coordinates 


are  calculated  as 


m”’  =  "k  *  “  ■  "k  >  *  'mT  '  >'k 


my’  =  Vk*"  ( m^  -  yfc  '  ■ 


where  m  =  1  to  Mj^  and  (  Xj^  ,  )  is  the  centroid  of  the  dictionary.  Note  if  the 

dictionary  is  pi'epared  so  that  its  centroid  is  (0,0)  then  (35)  can  be  reduced  to 


^x’  =  a  cqs(B]  ( )  +  sin(0)  (^y 


y  =  a  |^cos(0)  (^y)  -sin(0)  (jjx  ) 


Since  translations  from  the  same  shape  should  be  equal  (except  for  certain  cases  of 
symmetry)  a  clustering  of  translation  parameters,  similar  to  that  just  described  for 
a  and  9,  is  expected  for  the  estimates  of  x^  and  y^^.  Therefore,  an  additional  step  can 
be  implemented  to  further  reduce  the  possibility  of  false  alarms  and  large  errors  in 
the  estimate  of  z^. 

Consider  the  final  cluster  from  which  a  and  0  were  obtained.  Calculate  the  values 
X,  and  y.,  the  translation  coordinates  between  each  matched  feature  vector  from  the 
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c  =  the  index  over  the  cluster  obtained  from  the  estimates  of  a  and  9 
{  c=l  to  C  ), 

i  =  the  index  on  the  individual  components  of  each  feature  vector  from  the 
cluster, 

x’=  the  X  coordinate  from  the  transformed  dictionary'  page, 

y’=  the  y  coordinate  from  the  transformed  dictionary  page. 

Next  form  the  two  vectors  [  ^x^ ^x^ j.x^  and  [  ^y^  ^y^  ^y^ 

from  the  values  obtained  from  (37).  ITien  calculate  the  two  circulanl  difference 

matrices  T  and  T  normalized  by  r  and  r  (the  maximum  allowable  intra-cluster 
X  y  ^  X  y 

distance  for  x^  and  y^ )  as 
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Next  calculate  the  total  normalized  translation  difference,  as 


V 
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where  each  operation  is  performed  element  wise.  can  be  checked  for  clustering  in 
the  same  manner  used  previously  with  a  and  0.  In  general  all  elements  of  are 
expected  to  cluster;  however,  there  is  a  small  probability  (  dependent  on  the  values 


of  r  and  r  )  that  they  will  not.  In  this  case,  the  non-clustering  terms  are  discarded 
X  y 

and  the  final  estimates  of  x^  and  y^^  are  obtained  by  averaging  over  the  remaining 
cluster  elements. 


V  RESULTS 


A  3-dimensional  shape  dictionary  was  constructed.  It  consisted  of  two  airplanes, 
an  F-15  and  an  F-4.  A  value  of  15  degrees  was  chosen  for  0.^  and  a  total  of  20 
views  generated  for  each  plane.  The  rar^e  of  views  covered  ±30  degrees  pitch  and  0 
to  -45  degrees  roll.  The  boundary  curves  were  obtained  by  thresholding  a  512X512 
pixel  image  of  each  view  to  obtain  a  binary  representation.  A  contour  was  then 
obtained  from  this  image  and  processed  for  critical  points.  The  contours  obtained  for 
each  plane  are  shown  in  figure  14. 

A  value  of  13  was  chosen  for  the  window  length  of  the  averaging  filter  used  in  the 
first  stage  of  the  critical  point  extraction  routine.  The  values  used  for  the 
segmenting  thresholds  were  rp-.3  and  x-y=-i.2.  The  vakes  of  the  thresholds  used 
to  determine  the  confidence  numbers  were  i’3='2.4  and  r^=-4.8  respectively.  This 
corresponds  to  a  design  range  of  .5  to  1  for  the  scale  factor  a.  A  bias  value  of 
equal  to  .48  was  calculated  based  on  a  maximum  deviation  due  to  grid  rotation  of 
(V?-l)'*r-,.  A  maximum  normalized  distance  of  r  =.04  is  allowed  between  feature 
vector  components.  Maximum  cluster  distances  of  r^=±.075  and  degrees  were 
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selected  for  the  scale,  rotation  cluster  portion  of  the  algorithm.  Translation 

parameter  clustering  was  implemented  with  a  maximum  scattering  of  ±10  pixels  for 

both  r  and  r  .  A  cluster  weight  of  r  =.l  was  chosen  for  the  cluster  decision 
X  y  °  w 

procedure. 

The  problem  texts  were  then  extracted  from  the  scene  shown  in  figure  15.  This 
scene  contains  three  planes,  an  F15,  an  F-4  and  a  HARRIER,  placed  at  arbitrary 
orientations  within  the  range  covered  by  the  shape  dictionary.  The  HARRIER  was 
used  as  a  control  to  check  the  algorithm’s  ability  to  discriminate  non-dictionary 
shapes  from  dictionary  shapes.  The  video  camera  has  been  moved  away  from  the 
planes  to  provide  a  scale  change  and  adverse  lighting  has  been  employed  to  add 
uncertainty  in  the  determination  of  the  planes’  boundaries.  The  three  problem  texts 
shown  in  figure  16  were  extracted  from  the  scene  using  binary  thresholding.  They 
were  then  matched  with  the  shape  dictionary.  From  figure  16  it  can  be  seen  that 
only  partial  shape  information  is  recovered  for  each  plane  (  the  shadows  are 
interpretted  as  part  of  the  object  information ). 

Each  problem  text  was  cross-matched  with  the  two  objects  in  the  shape  dictionary. 
The  results  of  the  match  are  demonstrated  in  figure  17.  The  white  contours 
superimposed  over  the  original  scene  indicate  the  view  from  the  dictionary  matched 
to  each  problem  text  with  its  estimated  scale,  rotation  and  translation  parameters. 
The  first  problem  text  is  identified  as  an  F- 15  at  -15  degrees  pitch  and  -15  degrees 
roll.  The  second  problem  text  was  identified  as  an  F-4  at  0  degrees  pitch  and  -30 
degrees  roll.  The  third  problem  text  was  not  recognized  as  a  member  of  the  shape 
dictionary.  The  procedure  generated  no  false  alarms. 

VI  DISCUSSION 

If  additional  identification  power  is  required  a  second  stage  can  be  implememted  to 
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eliminate  portions  of  any  composite  shape  problem  text  as  they  are  identified, 
thereby  reducing  the  composite  problem  text  to  a  truncated  problem  text  \vhich  has  a 
higher  recognition  rate.  Also  false  alarm  rates  can  be  significantly  reduced  by 
implementing  a  similar  procedure  to  compare  boundary  points  once  a  possible  match 
has  been  indicated.  It  is  also  possible  to  extend  the  feature  set  chosen  for  each 
dictionary  shape  thus  reducing  the  probability  of  a  miss.  Unfortunately  this  results  in 
an  increase  in  match  times  required  for  each  dictionary  shape.  This  increase  in  time 
will  be  linearly  related  to  the  increase  in  the  feature  vector  set.  Miss  rates 
may  also  be  reduced  by  relaxing  the  maximum  distance  allowed  between  feature 
vector  components,  This  is  recommended  only  if  the  previously  discussed 
verification  procedure  is  implememted  because  relaxation  also  increases  the  rate  of 
false  alarms. 

If  a  sequence  of  dynamic  images  (  scenes  with  objects  in  motion  )  is  being 
analyzed,  it  is  believed  that  the  overall  recognition  rates  will  improve  over  the 
entire  course  of  the  sequence.  This  is  because  the  amount  of  occlusion  will  not 
remain  constant  during  the  entire  sequence,  thus  revealing  more  object  information 
in  the  totality  of  the  sequence  than  in  any  one  frame. 

Vll  CONaUSlONS 

The  definition  of  critical  points  using  the  concavity  tree  structure  and  the 
corresponding  identification  algorithm  described  in  this  paper  have  been  shown  to 
offer  the  following  features  and  advantages  when  the  identification  of  3-dimensional 
shapes  in  2-dimensional  scenes  is  considered.  The  theoretical  definition  of  critical 
points  using  the  concavity  tree  structure  of  a  shape  is  invariant  to  transforms  of 
scale,  rotation  and  translation  within  the  2-dimensional  scene.  In  the  actual 
implementation,  invariance  to  the  latter  two  transforms  is  maintained  and 


invariance  over  a  specific  range  of  scale  factors  is  obtained.  The  critical  point 
definition  is  reasonably  stable  under  conditions  of  shape  occlusion  and  partial 
framing,  such  that  partial  shape  information  is  usually  suficient  to  identify  a  shape. 
The  algorithm  requires  only  one  pass  per  concavity  tree  branch  to  extract  the 
critical  points.  These  critical  points  are  the  same  regardless  of  starting  point.  The 
critical  points  are  concentrated  in  areas  of  high  shape  dynamics,  i.  e.  the  highest 
areas  of  shape  quantification  occur  in  those  regions  of  the  shape  of  highest 
qualitative  interest.  The  algorithm  is  considered  robust  in  the  sense  that  ,  the  more 
complex  a  shape  becomes  the  higher  the  probability  of  identifying  it  in  a  scene.  This 
is  due  to  the  fact  that  more  critical  points  are  allocated  to  the  more  complex 
shapes.  The  algorithm  is  flexible  in  the  sense  that  it  can  be  modified  to  accomodate 
restrictions  on  memory  or  computation,  also  as  a  byproduct  of  the  matching 
procedure,  estimates  of  the  scale,  rotation  and  translation  parameters  of  an 
identified  shape  are  automatically  generated. 


APPENDIX 


The  proof  of  the  theorem  in  section  III  follows.  It  has  already  been  established 
that  the  operation  is  a  point  set  relationship  between  a  shape  (subshape)  and  its 
concavity  tree,  or  to  be  more  exact  a  shape  (subshape)  and  its  convex  hull.  Therefore 
it  is  sufficient  to  show  that  this  relationship  is  maintained  under  the  three 
transformations.  The  convex  hull  S  of  a  shape  S  is  defined  mathematically  as  the 
set  of  all  points  r  such  that 


r  =  Xp+(l-X)q;  0<X<1  al 

where  p,q  are  points  in  S.  This  includes  all  points  on  all  line  segments  joining  every 

2 

pair  of  points  in  S.  For  points  r,  p,  q,  in  R  (al)  may  be  rewritten  as 

(Px.Py)  +  (1-^)  (VV  ’ 

where  r  =Xp  +(1-X)q  ;  r  =Xp  ^(l-Xlq  ;  0<X<1.  The  effect  of  each  transformation  is 

XX  ^  y  y  y 

to  map  individ'jal  points  of  S  into  S’,  therefore  the  definition  of  S  is  simply  the 
above  definition  with  p,q  replaced  by  p’,q’.  Consider  first  the  convex  hull  of  a  shape 
under  the  transformation  S  (•).  It  becomes  the  set  of  all  points  r’  such  that 

r’  =  Xp’  + (j-X)  q’ ;  0<X<1 

or 

(rx’-r-y’)  =  X  (  ap^  ,  ap^  )  +  (1-X)  (  aq^  ,  aqy  )  ,  a3 

where 

«  Px  +  a  qx  ; 

r  ’  =  X  a  p  +  (1-X)  a  q  .  ad 

y  Py  Hy 

the  terms  in  (a4)  can  be  rearranged  so  that 


=  Of  [  A  +  (l-X)  ]  ; 

which  is  equivalent  to  the  transformation  S^(»)  on  the  whole  line  segment,  or 

f 

r’  =  ■  X  p  +  il-X)  q  ■ .  a6 

Taken  over  the  entire  set  S,  this  can  be  rewritten  as 

C^{  ( S ))  =  (  C^(  S  ))  ,  3? 

which  can  be  interpretted  as  "the  convex  hull  of  the  scaled  shape  S  is  equal  to  the 
scaled  convex  hull  of  S.  A  similar  argument  can  be  used  for  the  transformations 
Rq{*)  and  T{»]  with  (a4),  (a5)  and  (a6)  replaced  by  (ad),  [a9),  (alD)  and  (all), 
(al2),  (al3)  forRg(*)  and  T(*)  respectively.  Specifically, 


where. 


(  r  r  ’ )  =  X  (  p  COS0  -  p  sin0  ,  p  sin0  +  p  cos0  )  + 

X  y  X  y  X  y 

(l-X)  (q^  COS0  -  q^  sin0  ,  q^  sin0  +  q^  cos0  ) 
r  ’  =  X  (  p  COS0  -  p  sin0  )  +  (l-X)  (  q  cos0  -  q^  sin0  )  ; 

X  X  y  X  y 

-  --ry-=-X-(-p^  cos@-+  Py  sin0-)  t(l-X)-(-q^  sinS-  -  q,^  co50  ) 


a8 


which  can  be  rewritten  as, 

Tx’  =  [  X  Px  +  (l-X)  q^  ]  COS0  -  [  X  Py  +  (l-X)  Py  ]  sin0  ; 

ry’  =  [  X  p^  +  (l-X)  q^  ]  sin0  +  [  X  Py  +  (l-X)  Py  ]  cos0  a9 

and  finally. 


■  X  p  +  (l-X)  q 


alO 
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Similarly, 


which  can  be  rewritten  as, 

^x’  ~  ^  Px  ^  ^  \  ^  ^  ^  ‘  ^  \  ~  f  ^  Px  ^  ^x  ^  ^  ’^t 

f'y’  =  ^  Py  +  ^  yi  +  qy  -  ^  qy  +  yt  +  ^  yt  =  t  ^  Py  +  py  J  +  yt 

al2 

and  finally, 

r’  =  Tfxp+(l-X)q|.  al3 

; 

These  arguments  can  be  summed  up,  as  in  (a?)  for  the  scaling  transform,  by 
(a  14)  and  (a  15)  for  the  rotation  and  translation  transforms  respectively. 

C^{Kq(S))=Rq{C^(S))  .  al4 

C^(TiS}}  =  T(C^(S)]  .  al5 

It  has  been  shown  that’ the  relationship  between  the  "convex  hull  of  a  shape  with  the 
shape  itself  is  preserved  under  tire  defined  transformations.  The  only  other 
operations  involved  with  the  determination  of  C  from  S  are  intersections.  However, 
since  S^(*),  Rq(*)  and  TW  are  restricted  to  be  one-to-one,  all  intersections  are 
preserved  by  the  nature  of  the  transforms,  i.e.,  any  two  points  which  coincide  in  the 
image  of  the  transforms  will  also  coincide  in  the  preimage  of  transforms.  Therefore 
the  relationships  in  (24)  are  true. 
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Figure  1  -  Concavi'v  Tree  for  ArbUrarv  Shape  S 
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Figure  4  -  Concavity  Tree  for  Partial  Shape  S' 


Figu^  7  -  Flow  Diagram  for  Critical  Point  Extraction  Frocadnr' 
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igure  12  -  3-Dimensional  Object  Orientation 
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igure  13  -  Types  of  Concavities 
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PART  TWO 


A  Minimax  Risk  Quantizer  for  Noisy  Sources 


I .  Intro duct  ion 

Classical  quantizer  design  theory  has  focused  on 
determining  the  optimal  quantization  mapping  which 
minimizes  the  expected  value  of  a  distortion  measure 
between  the  quantizer  input  and  output.  These  techniques 
usually  rcjquire  a  priori  information  concerning  the 
statistics  (probability  distribution,  1st  and  2nd  moments, 
etc.)  of  the  signal  to  be  quantized.  The  need  for 
quantizing  sources  with  incomplete  knowledge  of  the  source 
statistics  is  well  recognized,  and  a  number  of  efforts 
[1)[6][12)  have  been  made  to  address  the  problem.  Bath 
and  VandeLinde  [1]  have  presented  the  fundamental 
development  for  minimax  quantization  of  signals  with 
distributions  from  a  unimodal  generalized  moment 
constrained  class.  The  minimax  quantizer  is  said  to  be 
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that  quantizer  with  a  maximum  distortion  between  tlie 
quantizer  input  anti  output  less  than  that  of  any  oLIkm' 
quantizer  for  the  given  distribution  class. 

Minimum  risk  quantization  theory  [4J  has  been  applied 
to  the  problem  of  quantizing  a  source  signal  corrupted  by 
independent  additive  noise  (figure  1)  and  has  shown  the 
promise  of  using  risk  theory  to  determine  an  optimal 
quantization  scheme.  This  approach  also  uses  considerable 
a  priori  information  concerning  the  source  and  noise.  The 
general  risk  theory  analysis  however,  allows  the  use  of  a 
number  of  estimation  schemes  to  solve  the  corruptive  noise 
problem.  Minimax  estimation  can  be  applied  to  the 
quantization  problem  to  produce  a  quantizer  which 
guarantees  a  maximum  risk  less  than  that  of  any  other 
quantizer.  In  section  II  the  general  quantization  problem 
is  stated.  Section  III  follows  with  a  description  of  the 
risk  theory  approach  to  quantization.  Section  IV  develops 
the  minimax  risk  quantization  theory  for  an  additive  noise 
corrupted  source  with  parallels  drawn  to  the  foundation 
provided  by  Bath  and  VandeLinde.  Section  V  describes  some 
simulation  results  followed  by  conclusions. 

1 1 .  General  Quantization 

Most  of  the  prior  work  in  quantizer  design 


charac  I,  er  i  z  e  the  quantizer  mapping  as  an  estimator  of  the 
quantizer  input.  A  decision  is  made  regarding'  w  i  I  ii  i  n 
which  region  of  the  domain  an  input  sample  falls,  and  a 
representative  value  is  determined  which  best  estimates 
elements  of  that  region.  Denote  the  quantizer  domain 
space  X,  and  the  finite  range  space  T.  Then  the  quantizer 
mapping,  y  -  q  (  a")  ,  produces  y  6  I,  'i  x  e  X.  The  design  c  f 
a  quantizer  consists  of  completely  specifying  qtA")  and  I, 
when  X  is  known.  The  mapping  is  surjective  since  every 
element  in  T  is  a  possible  outcome  of  q(A'),  x  6  X.  Since 
1  is  finite,  call  N  the  number  of  elements  in  Z.  Define 
an  index  set  on  the  elements  of  Z  so  that 

The  domain  can  be  partitioned  into  N  disjoint  sets, 

-i'j  ={■'(■  I  q('V)  =  .Kj'>  A-  e  X},  i-l,***,N,  with  U  ^  X. 

As  mentioned  above,  it  is  common  practice  to  require 
that  a  representative  value  y  estimate  the  sample  .y.  To 
this  end,  Z  and  q(Ar)  are  defined  so  that  y  closely 
approximates  x.  This  will  require  that  each  subset  be 

a  connected  subset  with  y^  €  .  The  use  of  risk  theory 

for  quantization  analysis  will  describe  the  quantization 
process  in  such  a  fashion  that  the  membership  requirement, 
y^  6  Xp  may  no  longer  be  applicable  and  will  not  be 


r equ i r ed . 


f I  1 .  Kisk  Quantization 

The  problem  of  quantizing  noisy  sources,  as 
illustrated  in  figure  1,  requires  new  techniques  to 
determine  optimal  quantization  schemes.  Here  an 
indepentlent  identically  distributed  (i.i.d.)  information 
source  has  been  corrupted  by  an  i.i.d.  additive  noise 
source,  and  it  is  required  to  quantize  the  resulting  noisy 
signal.  The  classical  quantizer  design  procedure  yields 
the  quantizer  mapping  which  produces  range  elements  that 
best  estimate  the  noisy  input  samples  which  may  produce 
unsatisfactory  results.  The  problem  is  caused  by  the 
perception  of  the  quantizer  as  an  input  estimator.  By 
changing  this  perception,  it  will  be  shown  that  the 
problem  may  be  solved  using  risk  and  estimation  theory. 

/?isk  quantization  is  based  upon  the  state!  space 
representation  of  figure  2.  Here  the  source,  9,  is  mapped 
through  some  probabilistic  transition  mechanism, 

P^.|q(,v|0),  from  the  parameter  space  to  an  observation 

space  Z.  The  quantizer  mapping  x)  is  the  decision  rule 
which  determines  a  representative  element  y  from  the 
decision  apace  T  which  estimates  the  source,  9,  based  upon 
the  observation  x  e  Z.  Since  the  quantizer  maps  onto  the 
decision  space  1,  and  the  <quanlizer  range  must  consist  of 
a  finite  number  of  elements,  the  elements  of  I  may  be 


indexed  so  that  y  ^ and  (J  y-  =  1  as  he  fore. 

A  1  t  li  o  u  K  li  the  observation  x  is  an  element  of  the  set  .by,  y  y 

representing  the  observation  set  will  not  be  required  to 
be  an  element  of  that  set.  This  is  because  the 
probabilistic  transition  (due  to  the  noise)  might  corrupt 
tiie  original  signal  source  such  that,  the  best  estimate 
does  not  lie  within  the  observation  set  d'y.  The  concept 

of  quantization  as  estimation  is  the  key  to  risk 
quantization,  with  the  specific  requirement  of  source 
estimation  rather  than  input  estimation.  This  will 
provide  the  necessary  quantization  schemes  for  noise 
corrupted  systems,  while  incorporating  the  classical 
quantizer  definition  of  input  estimation  for  the  noiseless 
case. 

To  determine  an  estimation  rule,  some  form  of  quality 
measure  is  required.  This  may  be  given  by  a  cost  function 
C(0,y)  over  @  x  X  which  represents  the  cost  associated 
with  estimating  the  parameter  0  by  decision  y,  based  on 
observation  x,  V  0  €  ©,  a"  e  X,  y  6  I.  The  cost  is 
generally  expressed  as  a  function  of  the  difference 
between  the  parameter  and  its  estimate, 

C(0,y)=C(e-0-y)  (1) 

With  the  specified  cost  function,  the  risk  is  defined  as 
the  expected  value  of  the  cost: 


R  =  Kfc(  e.  v) )  ]  (  2 

Ksl  ininl  lot)  Ih'Mit  v  lerhtiiques  may  1) e  employed  to  determine 

the  quantisation  mapping  q{.v)  which  minimizes  the  risk 
f  u  n  <:  I  ion. 

I  V  .  Miiiimax  Risk  Qiiantization 


The  risk  theory  approach  has  been  applied  to  the 
additive  noise  problem  depicted  in  figure  1  for  well 
defined  a  priori  source  and  noise  distributions  to  obtain 
a  minimum  risk  quantization  solution  [4],  It  is  shown  in 
i4!  that  the  risk  approach  is  a  viable  one  to  quantizer 
design  for  such  systems,  and  is  equivalent  to  the 
classical  result  for  noiseless  systems. 

It  IS  often  likely,  however,  that  the  quantizer 
liesign  must  be  based  on  incomplete  knowleilge  of  the  signal 
statistics.  for  example,  a  problem  may  arise  when  a 
quantizer  is  needed  for  a  source  where  the  statistics  are 
fairly  well  known,  but  various  unknown  noise  d  i  s  1.  r  i  b  ii  t  i  o  n  s 
can  corrupt  the  source  signal.  Likewise,  the  noisy 
characteristics  of  a  sensor  may  be  well  defined,  but  the 
exact  probability  distribution  of  the  source  signal  to  be 
quant  ized  with  the  sensor  unknown.  These  types  of 


problems  suggest  the  use  of  minimax  estimation  to 


determine  the  quantization  mapping.  Minimax  estimation 
seeks  to  minimize  the  maximum  possible  distortion  by  the 
estimator,  and  is  directly  applicable  to  the  risk  approach 
to  quantizer  design. 

This  is  demonstrated  by  examining  the  problem  of  an 
additive  noise  source  using  risk  theory  and  minimax 
estimation.  Specifically,  the  source  statistics  and 
probability  distribution  will  be  assumed  known  a  priori. 
The  exact  distribution  of  the  noise,  however,  will  be 
assumed  to  be  unknown,  but  belonging  to  a  generalized 
moment  constrained  class  of  distributions,  C,  A  risk 
analysis  of  the  system  will  be  applied  using  minimax 
estimation  to  produce  a  minimax  risk  quantization  scheme 
for  the  system.  Further,  it  will  be  shown  that  the 
analysis  is  reversible  and  may  be  applied  to  the  additive 
noise  problem  when  the  noise  probability  distribution  is 
well  defined,  but  the  knowledge  of  the  source  distribution 
is  incomplete. 

•jje 

A  minimax  estimator  t  is  one  which  guarantees  a 
maximum  risk  no  greater  than  that  for  any  other 
estimator  [11].  That  is, 

supR*(0)  <  supJ?(0)  (d) 

0  t’  0  t 

Likewise,  the  minimax  quantizer  will  be  defined  as  one 
which  guarantees  the  minimum  maximal  risk  for  all 
quantizers.  Miniraax  quantization  theory  has  been 
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developed  for  the  classical  quantizer  problem  of  noiseless 
sources  by  Hath  and  VandeLinde  [ IJ.  The  approach 
developed  will  be  applied  in  the  risk  theory  development 
of  an  appropriate  quantizer  for  the  additive  noise 
prob lem . 

Known  Source  Distribution 


Consider  the  sequence  of  independent,  iflentically 
distributed  random  variables  (  >  from  the  known 

cumulative  probability  distribution  function  (c.p.d.f)  Fg. 


The  source  sequence  has  been  corrupted  by  a  noise  sequence 
with  unknown  c.p.d.f.s  {F^  ),  to  produce  the 

observation  sequence  {Xj^}  to  the  quantizer. 

The  sequence  {F^  },  while  unspecified,  are 


constrained  to  belong  to  the  set  C  of  all  possible 
c.p.d.f.s  with  whatever  a  priori  information  of  the  noise 
characteristics  is  available.  The  set  to  be  considered 
here  is  the  set  of  c.p.d.f.s  belonging  to  the  generalized 
moment  constrained  class.  These  distributions  are 
required  to  have  a  generalized  moment  less  than  or  equal 
to  some  finite  constant.  This  may  be  viewed  as  a 


restriction  of  the  noise  power  to  be  finite. 


The  mlnimax  risk  is  defined  as; 


R  -  min  max  q .  f  o .  !■' a/)  > ) 

where  R(q,Fg,Fyy^)  is  the  risk  for  an  N-level  quantizer  q 
from  the  set  of  all  possible  N-level  quantizers  (3j|j  and  the 

c.p.d.f.s  Fq,  described  above.  The  minimax  risk 

quantizer,  q  ,  is  that  quantizer  which  provides  the 

♦ 

minimax  risk  R  .  The  properties  of  the  minimax  risk 
quantizer,  analogous  to  the  minimax  quantizer,  are: 

1)  V  F^6  C,  a  R^*  >  R* 

where  R  is  the  absolute  maximal  risk  for 
quantizer  q*. 

2)  a  q  6  (5  ,  q  5^  q*  3  max  R(q,Fg,FJ  <  R* 

"  F^eC  ®  ^ 

where  q  guarantees  a  maximum  risk  no 
greater  than  the  maximum  risk 
of  any  other  quant  izer  q  e  0^^ . 

The  quantizer  mapping,  q(A-)  6  ,  maps  an  .y  g  to  a 

quantizer  level  G  T,  i=l,*'*,N.  For  the  scalar 

quantizer,  ^  i )  ,  where  =  -<*>,  and 

the  quantizer  thresholds  comprise  the  set  {.Vy}, 
i=l,‘*',N+l.  Note  that  the  set  of  quantizers  (3J^J  does  not 
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require  the  quantizer  level  y-  be  an  element  of  X  ■. 

The  risk  for  any  particular  quantizer  mapping  may  be 
(lefineci  as: 

RCq.Fg.F^)  =  J  J  C(e,q(x))  dF^p^te)  de  (5) 

e  N 

where  the  inner  integral  is  a  Lehes gue -S t i e 1 t j es  integral 
over  l;he  sequence  of  noise  c.p.d.f.s  {F*,  }.  The  order  of 

integration  is  changed  to  yield: 

R(q,Fg,F^)  =  J  J  Cle.qlA-))  PgO)  de  dF^  (6) 

JV  © 

The  purpose  of  the  interchange  is  to  facilitate  the 
minimax  operation  procedure  that  occurs  later. 

It  is  apparent  that  if  the  system  is  noiseless,  the 
risk  function  is  precisely  that  of  the  classical  quantizer 
distortion  function.  The  minimax  quantizer  then  is  the 
same  as  the  results  of  Max  [9]  since  the  source 
distribution  is  assumed  known. 

=  inf  J  C(0,q(x))  Pq(9)  d0,  9  -  y 

R*  =  inf  f  C(0,q(0))  Pa(9)  de  (7) 

X  8  * 

The  cost  function  used  is  restricted  to  some  bounded 
distortion  measure,  d(e,q(.Y)),  between  the  source  atid 


quantizer  output: 


C  f  0  ,  q  (  .V  )  ) 


d  (  0  ,  q  (  ,Y )  )  <  r, 
d(0,q(,Y))  ;•  L 


(  «  ' 


Tlie  dislorl.ion  m«iasure  d(0,q(A’))  is  required  to  be  even, 
continuous,  raonotonic  strictly  increasing  in  |0-q(,Y)|,  and 
zero  for  perfect  estimation.  The  bound  on  the  cost 
implies  no  additional  penalty  for  a  distortion  larger  than 
the  limit,  L. 

The  generalized  moment  constraint  is  given  by  some 
function  pin)  satisfying  the  same  conditions  as  those  for 
the  distortion  function  such  that; 


J  p( n)  dF{n)  <  c . 
N 


(9) 


The  constraint  function  pin)  simply  implies  that  the  noise 
signal  power  is  bounded.  The  distortion  and  constraint 
functions  considered  here  are: 


d  i  0,  <7(  )  =  I  0-q(  x)  I  " 


(10) 


pi  n) =  \  n\ 


m  ^n 


The  conditions  of  bounded  cost  and  those  imposed  on 
d(0,q(A’))  and  pin)  are  required  for  the  Lagrange 
minimization  of  the  next  section. 


The  problem  then,  is  to  determine  the  minimax  risk 


quantizer  q*  for  the  worst  case  c  .  p  .  d  .  f  .  ,  F  w  h  i  c  h 


achieves 


73 


rmrwj^  wjmj' 


inf  sup  R(<i.Fg,F^)  ^  R(q*,FQ,F^).  (11) 

Th(?  set.  (3|yj  is  t.  he  set  of  all  symmetric  quant  izt!ts  where 

the  levels  are  symmetric  about  the  source  mean,  and  the 
thresholds  are  symmetric  about  the  observation  mean.  The 
set  C  is  the  set:  of  c.p.d.f.s  belonging  to  the  generalized 
moment  class,  and  R  is  the  risk  function  using  the  cost 
function  C(B,q{x)).  To  determine  the  minimax  risk 
quantizer,  it.  first  necessary  to  find  the  maximum  risk  due 
to  a  particular  quantizer  mapping  q  G  (3^^ ,  then  determine 

which  quantizer  provides  the  minimum  of  all  of  the  maximum 
risks. 

It  has  been  shown  [1)  that  the  set  C  is  a  weak 
compact  subset  of  a  normed  Banach  vector  space  (N.B.V.)  of 
normalized  functions  of  bounded  variation.  Also  with  any 
linear  functional  on  N.B.V.  [O,®®]  (such  as  R(q,Fg,  Fyy)  )  , 

which  is  weak*  continuous  in  F^,  then: 


V  q  €  (3^,  3  F*  €  C,  R*(q)  3: 

R*(q)  =  RCq.FgPjl^)  =  max  R(q,F0,F^) 

F^eC 


(  121 


Furthermore,  a  method  for  determining  a  minimax  quantizer 
via  a  constrained  minimization  in  a  Lagrange  multiplier 


space  (R*)  through  the  use  of  the  Lagrange  duality  theorem 
has  been  developed  [1].  The  technique  will  be 
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paraphrased  here  to  determine  a  minimax  risk  quantizer. 

Let  C  =  ^^'yV  ^  N.B.V.[0,<“]}  wliere  exhibits  ttie 

following  properties; 


1)  Fyy  is  nonnegative,  monot on i cal  1 y  nondecreasing, 

21  I  dF^  <  I, 

ff 

2)  Jp(/7)F^(/j)<c. 

N 


Define  the 

convex  set  C  - 

{  F^  6  N.B. 

V .  [ 0 , »]  }  ,  where  F  ^ 

satisfies 

1)  . 

Now  define 

a  convex  functional 

G  :  N  .  B  . 

V  .  [  0  ,  «]  — * 

G  ^ 

.  <J 

f  -  1  ' 

f  " 

which  represents  the  necessary  constraints  for  to  be  a 

c.p.d.f.  and  have  a  generalized  moment  constraint.  This 
i mp 1  i  es ; 

c’  =  {  €  C;  G(F^)  <  0  ) 

By  the  Lagrange  duality  theorem  [8],  an  expression 
for  the  maximum  risk  due  to  the  quantization  mapping  q(.v) 


may  be  found: 


R  (  q  )  =  aup  R(  q  ,  Fg,  F^) 


sup  R(q,Fg,F^) 

F^6c’ 


sup  ^  R(  q  ,  Fg,  F 


\  [|  -‘''a-  '  l]  ^  "2  [J  " 


')  } 


where  the  inner  maximization  is  achieved  for  some  worst 

— 

case  distribution,  and  the  outer  minimization  by 

and  X^.  If  the  cost  weighting  function  is  bounded, 

R*(q)  is  finite  and  equation  (6)  can  be  used  to  replace 
^(Q.Fg.Fy^)  of  equation  (13)  to  yield: 


R{q)  =  min  [(X  +  Xc)  -1  B(X  X  ) 

Xj,X2>0  ‘  ^  = 


where 


B(X^,X^)  =  max  J  I  {  n)  dF^(/;) 


Fyy^eC  JV 


I  {  n)  =  J  C(0  -  qlxf))  pg(0)  d0  -  X^  -  \^f>{  ti)  (IB) 


The  minimization  of  equation  (14)  need  only  be 
considered  when  B(X^,X^)  is  finite.  This  will  only  occur 

when  I  {  n)  is  nonpositive.  If  !(/?)  is  positive  at  any 
point,  a  sequence  of  noise  random  variables  !  with 

c.p.d.f.s  iF;i,  }  could  occur  such  that  as  k  gets  large,  th 


^5^ 


t  * 


integration  tends  to  infinity. 


If  !(/?)  IS  r(.'<)uiriMl  to  be  nun  positive,  then  thi' 
maximization  of  equation  (15)  over  all  possible  G  C 

imp  lies: 

B( X^)  =  0.  (17) 


Since  Fyy  *  constant  is  an  element  of  C  for  it 

follows  that  if  I {  n)  is  made  nonpositive,  the  maximum 
possible  integration  of  equation  (15)  is  zero. 

The  minimax  risk  for  any  quantizer  Q  ^*3^^  and  the 


specified  conditions  then  reduces  to; 


R*(q) 


min  (  X  . 

,  X  >0  ^ 

1  ’  a 

I(/7)  <0 


X^c) 


(IB) 


Figures  3  through  6  are  a  series  of  curves  which 
depict  the  function  I  (  ;?)  for  various  quantizers  with 
Pg(d)  ~  N(0,l),  and  distortion  bounds  of  L  =  0.25,  0.75. 


These  curves  show  the  effects  of  the  distortion  bound  on 
the  function,  and  that  it  is  quantizer  dependent.  Note 
the  discontinuity  near  the  quantizer  threshold. 

The  minimax  risk  quantizer  q*  then,  is  that  quantizer 
which  produces; 

R  =  min  min  (  x  +  \  c '  i  1 9 

qe<3N  ' 

I(/7)<0 

This  result  is  similar  to  that  obtained  for  minim ax 
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quantizal  ion  by  Bath  ami  VancieL  inde  for  the  general  iy:ed 


moment  c o n s  t  r a  i  n ed  class  of  quantizer'  input  distributions. 
It  differs  primarily  in  the  function  li/i)  wliich 
incorporates  the  cost  of  estimating  the  source  try  the 
quantization  mapping  and  the  a  priori  knowledge  of  the 
clean  source  probability  distribution  function,  F^Ce). 

The  minimization  of  equation  ( 14 t  over  all 
nonnegative  X^,  X^,  and  nonpositive  I ( a)  implies  the 

maximum  of  1(a)  is  zero.  As  can  be  seen  in  equation 
(16),  the  role  played  by  X^  in  1(a)  is  merely  that  of  a 

bias  term.  This  implies  that  the  minimization  of  equation 
(14)  can  be  determined  by  solving: 


X* ( X  )  =  max 

^  ^  n 


J  C(0,q(x))  Pq(0)  d0  -  X^p(a) 
© 


(20) 


and  minimizing  over  all  possible  X^  >  0.  Maximization  of 


1(a)  with  respect  to  X^  and  ^  specific 

quantizer  q  €  will  then  yield  the  minimum  X^  and  the 
requisite  X^  for  max  1(a)  =  0. 


Known  Noise  Distribution 


The  minimax  risk  theory  is  easily  modifiird  for  the 
converse  problem.  That  is,  the  prblem  considered  thus  fat 


is  that  of  quantizing^  an  i.i.d.  source,  with  known 
probability  distribution,  corrupted  by  a  scquenci;  of 
independent  additive  noise  random  variables  each  of 
uncertain  distribution.  If  the  noise  sequence  is  i.i.d 
with  known  c.p.d.f.  F^,  but  the  source  sequence  f  is 

only  known  to  be  distributed  with  the  c.p.d.f.  sequence 
(F^  \  from  the  set  C,  then  the  risk  may  be  written: 

R(q,Fg,F^)  =  J  J  C(0,q(  -'■)  )  P^(  d/?  dFg  ( 

6  N 

The  minimax  risk  quantizer  development  for  this 
problem  parallels  the  development  for  the  previous  case 
A  similar  solution  is  described  as  the  minimax  risk 

quantizer  q*  which  yields  the  minimum  maximal  risk, 

R  ( q  )  ,  where: 

R*(q)  -  rain  ^A+Xc)  ( 

X  ,  X  >0  ^  “ 

1  ’  a 

T  (  0)  <0 

a  n  d 

1(0)  -  J  C(0-  q(.v))  Pyy(/7)  d/7  -  x^  -  X^p(0)  ( 

N 

The  same  restrictions  apply  to  the  distortion  and 


constraint  functions,  d(0,q(x'))  and  p(9)  as  before. 


X  (X  I  -  max 
^  ^  0 


i  J  " 

'■  N 


C(0,q(,v))  P^(«)  ci/)  -  X  p(  0) 


and  the  mini  max  risk  quantizer  q  is  specified  ljy  a 


procedure  analogu\js  to  that  of  the  previous  section. 


Figures  7  ctirough  10  show  the  effects  due  to  the 


constraint  on  the  noise  power  and  the  distortion  bound  L 


on  tlie  function  1(0).  These  appear  roughly  similar  to 


I(s)  in  [1],  differing  through  the  knowledge  of  the  noise 


for  1(0). 


Note  that  in  the  noiseless  case,  this  risk 


definition  coincides  precisely  with  the  distortion 


function  of  an  unknown  input  sequence  used  by  Bath  and 


VandeLinde.  The  minimax  risk  quantizer  then  is  identical 


to  the  minimax  quantizer  developed  there  for  the  classical 


quantizer  distortion  measure. 


V .  Results  and  Conclusions 


The  minimax  risk  quantizer  determination  procedure, 


13  described  by  the  following  algorithm.  No  attempt  has 


been  made  to  optimize  the  computation  procedure 


1)  Select  X  iO. 


2)  Determine  the  optimal  X^  for  the  specified 


3)  Repeal  step  2)  over  all  >^2  -  0  to  determine 

the  minimum  of  (Xj^(X2)  +  ^2  •  This  is  the 

maximum  risk  for  the  quantizer  q:  R  tq). 

4)  Repeat  steps  l)-3)  over  all  quantizers 
q  6  <3|^  to  determine  the  minimum  of  the 

maximum  risks;  R  =  min  R  (q). 

5)  The  miniraax  risk  quantizer  is  that 
quantizer  q  which  yields  the  minimax  risk: 

R*(q*)  =  R*. 

It  appears  from  the  plots  of  I(;7)  that  the  areas  of 
maxima  occur  at  or  within  a  ^  -  neighborhood  of  the 
quantizer  thresholds.  A  simulation  has  been  performed 
based  on  this  observation  with  the  results  appearing  in 
tables  1  and  2.  These  results  were  determined  by  sampling 
I ( n)  in  the  region  surrounding  each  threshold.  The 
quantizer  step  size  was  fixed  at  0.01  and  the  accuracy  of 
the  Lagrange  minimization  is  such  that  the  error  in  the 

-7 

estimate  is  at  most  1  x  10  .  The  minimax  risk  quantizer 

with  specific  quantizer  thresholds  were  found,  and  the 
minimum  of  these  selected  as  the  minimax  risk  quantizer. 

Comparisons  of  the  rainimax  risk  quantizers  to  the 


Max-I,  loyd  and  minimum  risk  quantizers  (G.S.+G.N.)  are 

shown  at,  thi.-  bottom  of  tables  1  and  2.  Tlu?  mean  squaie 
power  constraint,  c,  is  fixed  such  that  for  the  Gaussian 
noise  case,  the  Signal-lo-Noise  Rat.  io  (SNR)  is  b  dB.  This 
value  is  shown  in  table  1.  At  8  dB ,  the  thresholds  of  all 
three  are  nearly  identical,  with  the  levels  for  the 
minimum  risk  quantizer  inside  (closer  to  the  mean)  those 
of  the  Max-Lloyd  quantizer,  and  the  minimax  risk  quantizer 
levels  just  slightly  inside  those  of  the  minimum  risk 
quantizer.  The  constraint  c  is  fixed  for  table  2  so  that, 
for  the  Gaussian  noise  case,  the  SNR  is  4  dB .  At  4  dB , 
the  outer  threshold  of  the  minimax  risk  quantizer  has 
migrated  well  outside  of  the  other  two  quantizers,  which 
may  be  from  the  Max-Lloyd  and  minimum  risk  quantizers 
assumption  of  Gaussian  densities,  while  the  minimax  risk 
quantizer  only  considers  constrained  noise  power.  The 
levels  of  the  rainimax  risk  quantizer  are  now  between  those 
of  the  Max-Lloyd  and  minimum  risk  quantizers.  This  may  be 
interpreted  from  the  standpoint  of  the  assumptions  also. 
The  minimum  risk  quantizer  assumes  both  the  source  and 
noise  are  Gaussian,  so  it  arrives  at  closely  compacted 
estimates  for  the  levels.  The  minimax  risk  quantizer,  on 
the  other  hand,  only  assumes  the  source  is  Gaussian  for 
this  example,  and  the  noise  power  constrained  --  the^refore 
the  estimates  are  outside  those  of  the  minimum  risk 


quantizer  (for  tliis  example).  The  rainimax  risk  quantizer 


compensates  for  the  corrupting  influence  of  the  noise, 
whereas  the  Max-Lloyd  quantizer  does  not.  For  this 
reason,  the  estimates  for  the  quantizer  levels  of  the 
minimax  risk  quantizer  are  then  placed  within  those  of  the 
Max-Moyd  quantizer. 
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TABLE  1 


Minimax  Quantizer  Determination 


(  N  =  4, 

Squared- 

Error-  Cost, 

Pg  ~  N(0.  1)  ) 

-  0, 

L  =  0.25, 

SNR  =  8dB 

c  =  0  .  1585 

yo 

•’"1 

Miniraax-H  isk 

0 . 90 

0.41 

1.23 

0 . 09931 

0.91 

0.41 

1.24 

0 . 09854 

0 . 92 

0.41 

1.24 

0 . 09780 

0.93 

0.41 

1 . 24 

0.09708 

0.94 

0.41 

1.24 

0 . 09638 

0.95 

0,42 

1.24 

0.09571  < - 

0.96 

0 . 42 

1 . 28 

0 . 09743 

0.97 

0 .42 

1.31 

0.09921 

0.98 

0.41 

1.31 

0 . 09875 

0.99 

0.41 

1.31 

0.09818 

1 . 00 

0.41 

1.31 

0 . 09762 

1.01 

0,41 

1.32 

0.09709 

1 . 02 

0.41 

1.32 

0 . 09657 

1.03 

0,41 

1.32 

0.09606 

1 . 04 

0.41 

1.36 

0.09807 

1.05 

0.41 

1.39 

0.09973 

Max- 

L 1  oyd 

Minimum 

Risk* 

Min imax 

Risk 

yj 

•^i 

yj 

0 

0.4874 

0 

0.4774 

0 

0 . 42 

. .  056 

1 . 625 

1.056 

1.298 

0.95 

1 . 24 

^Minimum  Risk  for  Gaussian  Sources. 


TABLE  2 

Minim  ax  Quant,  izer  Determination 
(N  =  4,  Squared-Error  Cost,  ~  N(0,ln 

=  0.  L  -  0.25,  SNR  =  4dB  ^  c  ^  0.39H 
"  •'  M  in  i  max  -  li  i  s  k 


1 . 27 

0.41 

1.51 

0 .  12232 

1 . 28 

0.41 

1 . 53 

0 . 12372 

1 . 29 

0.41 

1.56 

0 . 12500 

1 . 30 

0.41 

1.56 

0. 12447 

1.31 

0.41 

1 . 56 

0 . 12395 

1 . 32 

0.41 

1 . 57 

0. 12343 

1 . 33 

0.41 

1 . 57 

0 . 12294 

1 . 34 

0.41 

1 . 57 

0. 12254 

1 . 35 

0.41 

1 . 57 

0.12198 

1.36 

0.41 

1 . 58 

0. 12348 

1 . 37 

0.41 

1 . 60 

0. 12492 

1 . 38 

0.41 

1 . 60 

0.  12450 

1 .  39 

0.41 

1.60 

0. 12408 

1.40 

0.41 

1.60 

0 . 12367 

1.41 

0.41 

1.61 

0. 12326 

Max -Lloyd 

Min iraum 

1  Risk* 

Min i max  Risk 

0  0.5354  0  0.3830  0  0.41 

1.160  1.705  1.160  1.277  1.35  1.57 


Minimum  Risk  for  Gaussian  Sources. 
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